fix: replace copy with deepcopy in Mooncake pullbacks #723

gdalle · 2025-02-09T22:10:49Z

Replace copy with deepcopy when returning general tangents to fix How to use Mooncake in Lux chalk-lab/Mooncake.jl#467. Add a shortcut for numbers and arrays to avoid paying the price of deepcopy.
Use Mooncake.value_and_gradient!! for DI.gradient. The same reasoning applies for copying before returning.
Add type annotations on f::F everywhere to force specialization.

@willtebbutt does this look good?

codecov · 2025-02-09T23:20:23Z

Codecov Report

Attention: Patch coverage is 96.15385% with 1 line in your changes missing coverage. Please review.

Project coverage is 97.98%. Comparing base (08f09b4) to head (6778b17).
Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
...MooncakeExt/DifferentiationInterfaceMooncakeExt.jl	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #723      +/-   ##
==========================================
- Coverage   97.99%   97.98%   -0.01%     
==========================================
  Files         121      121              
  Lines        6341     6363      +22     
==========================================
+ Hits         6214     6235      +21     
- Misses        127      128       +1

Flag	Coverage Δ
DI	`98.94% <96.15%> (-0.02%)`	⬇️
DIT	`95.92% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

willtebbutt

Largely LGTM. Would it make sense to add a regression test which checks that the new copying mechanism works for parameters which are e.g. Tuples / NamedTuples ?

gdalle · 2025-02-09T23:31:59Z

I would like to ensure the absence of regression more systematically by using test scenarios which involve tuples or nested structs. One option is to define new ones (#724). Another option would be to implement the necessary standardization so that Mooncake can be tested on Flux and Lux scenarios (no idea how complicated that would be). A third option would be to simply check that Mooncake runs on these neural net scenarios, without ensuring correctness because comparison to the reference gradient requires standardization. Thoughts?

willtebbutt · 2025-02-09T23:47:47Z

Hmmm interesting. Is it documented anywhere what standardisations must be applied in order to support Flux / Lux? Additionally, do the two frameworks agree? I'd be keen to get Mooncake tested on them as part of DI / Mooncake's own tests.

I don't want to slow down this PR though, so perhaps just merge this, and we can discuss elsewhere?

gdalle · 2025-02-09T23:55:50Z

Not documented, but these standardizations are necessary to compare the gradient returned by different backends. But I've never even tried Mooncake on such scenarios so maybe it does the right thing / agrees with Zygote out of the box. Let's merge this and discuss further in #724

gdalle added 11 commits February 7, 2025 10:29

test: faster compilation with -O1 option

fea0d07

Add timing

2f948e3

Time O3

b49923e

Time in the correct place

0bc52de

O2

cbb241d

O1

974f629

O0

ad170d9

Back to O1

291ed3a

No benchmark test on 1.10

5264b56

Benchmark on 1.11 and higher

241a725

fix: replace copy with deepcopy in Mooncake pullbacks

f2596c8

gdalle mentioned this pull request Feb 9, 2025

How to use Mooncake in Lux chalk-lab/Mooncake.jl#467

Open

gdalle added 5 commits February 9, 2025 23:23

Run -O1 only for draft PRs

c258363

Merge branch 'gd/o1' into gd/mooncake_copy

8496360

use value_and_gradient!!

46f90ff

Remove o1 changes

0d99ca2

Only arrays of numbers skip deepcopy

6778b17

gdalle mentioned this pull request Feb 9, 2025

Mooncake less efficient than Zygote or Enzyme on Flux layers chalk-lab/Mooncake.jl#466

Open

willtebbutt reviewed Feb 9, 2025

View reviewed changes

gdalle merged commit 4e694bb into main Feb 9, 2025
48 of 50 checks passed

gdalle deleted the gd/mooncake_copy branch February 10, 2025 00:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: replace copy with deepcopy in Mooncake pullbacks #723

fix: replace copy with deepcopy in Mooncake pullbacks #723

Uh oh!

gdalle commented Feb 9, 2025 •

edited

Loading

Uh oh!

codecov bot commented Feb 9, 2025 •

edited

Loading

Uh oh!

willtebbutt left a comment

Uh oh!

gdalle commented Feb 9, 2025

Uh oh!

willtebbutt commented Feb 9, 2025

Uh oh!

gdalle commented Feb 9, 2025

Uh oh!

Uh oh!

Uh oh!

fix: replace copy with deepcopy in Mooncake pullbacks #723

fix: replace copy with deepcopy in Mooncake pullbacks #723

Uh oh!

Conversation

gdalle commented Feb 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Feb 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

willtebbutt left a comment

Choose a reason for hiding this comment

Uh oh!

gdalle commented Feb 9, 2025

Uh oh!

willtebbutt commented Feb 9, 2025

Uh oh!

gdalle commented Feb 9, 2025

Uh oh!

Uh oh!

Uh oh!

gdalle commented Feb 9, 2025 •

edited

Loading

codecov bot commented Feb 9, 2025 •

edited

Loading